Universal Access To All Knowledge
Home Donate | Store | Blog | FAQ | Jobs | Volunteer Positions | Contact | Bios | Forums | Projects | Terms, Privacy, & Copyright
Search: Advanced Search
Anonymous User (login or join us)
Upload

Reply to this post | Go Back
View Post [edit]

Poster: mellamokb Date: Apr 24, 2014 11:41am
Forum: web Subject: Zip Corrupted by Wayback Machine Banner

Hi,

I am trying to download a ZIP file that has disappeared from a website at the following url

http://intellecting.net/blog/file.axd?file=WCFProxyGenerator.zip

The zip file is linked by a blog post but now returns 404. So I found an older version on the WayBack Machine and attempted to download from archive.

https://web.archive.org/web/*/http://intellecting.net/blog/file.axd?file=WCFProxyGenerator.zip

This returns a zip file I can download, but it is reported as invalid when I attempt to open it. I tried the recommended solution (add 00 at the end with a hex editor) with no luck. However, when I browse with hex editor, I notice something very odd:

Offset(h) 00 01 02 03 04 05 06 07 08 09 0A 0B 0C 0D 0E 0F

00000230 51 EF BF BD EF BF BD 67 EF BF BD 47 1F 1D 06 2D Q��g�G...-
00000240 5F EF BF BD D5 BB 6B EF BF BD EF BF BD 7F 27 EF _�ջk��.'ï
00000250 BF BD 12 10 EF BF BD EF BF BD EF BF BD 7C EF BF ¿½..���|ï¿
00000260 BD EF BF BD 59 5B EF BF BD EF BF BD 49 45 EF BF ½ï¿½Y[��IEï¿
00000270 BD DE B5 EF BF BD 74 EF BF BD 5F EF BF BD EF BF ½Þµï¿½t�_�ï¿
00000280 BD EF BF BD 6A EF BF BD 45 33 EF BF BD 36 4D EF ½ï¿½j�E3�6Mï
00000290 BF BD 1D 05 74 52 EF BF BD 1C 06 7B EF BF BD EF ¿½..tR�..{�ï
000002A0 BF BD 04 30 75 36 05 1C EF BF BD 38 EF BF BD 0A ¿½.0u6..�8�.
000002B0 0A 3C 73 63 72 69 70 74 20 74 79 70 65 3D 22 74 .<script type="t
000002C0 65 78 74 2F 6A 61 76 61 73 63 72 69 70 74 22 20 ext/javascript"
000002D0 73 72 63 3D 22 2F 73 74 61 74 69 63 2F 6A 73 2F src="/static/js/
000002E0 61 6E 61 6C 79 74 69 63 73 2E 6A 73 22 20 3E 3C analytics.js" ><
000002F0 2F 73 63 72 69 70 74 3E 0A 3C 6C 69 6E 6B 20 74 /script>.<link t
00000300 79 70 65 3D 22 74 65 78 74 2F 63 73 73 22 20 72 ype="text/css" r
00000310 65 6C 3D 22 73 74 79 6C 65 73 68 65 65 74 22 20 el="stylesheet"
00000320 68 72 65 66 3D 22 2F 73 74 61 74 69 63 2F 63 73 href="/static/cs
00000330 73 2F 62 61 6E 6E 65 72 2D 73 74 79 6C 65 73 2E s/banner-styles.

It appears that the WayBack Machine banner has been inserted into the middle of the raw archive binary! I have tried removing it using a hex editor, but I can't tell where the exact boundary is that the real ZIP data ends and the inserted content begins. There is content inserted both here and at the end of the file that has to be removed. I've tried 5 or 6 variations leaving different number of 0A (line return) bytes in the original, but every time 7-ZIP sees the file as invalid archive.

Anyone have any idea how I can correct this ZIP file and open it? Does anyone know which exact byte offsets I will need to remove?

Thanks for your help!

~ mellamokb

This post was modified by mellamokb on 2014-04-24 18:41:41

Reply to this post
Reply [edit]

Poster: Administrator, Curator, or StaffDFJustin Date: May 27, 2014 6:26pm
Forum: web Subject: Re: Zip Corrupted by Wayback Machine Banner

There is a feature of the WayBack Machine allowing you to retrieve the unmodified file. Just add "id_" after the date code in the URL:

https://web.archive.org/web/20120329121151id_/http://intellecting.net/blog/file.axd?file=WCFProxyGenerator.zip

Terms of Use (10 Mar 2001)